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Xylanase, oligonucleotidic sequence 
encoding it and its uses 

The present invention relates to a xylanase and to a nucleotidic 
sequence encoding it. 
5 It also relates to the use of this enzyme in the bleaching of 

paper pulp and the preparation of xylose or of xylo-oligosaccharides 
from plant raw materials, in particular. 

Varied uses have been proposed for xylanases in the 
biotechnology field, especially in the foodstuffs field (Biely, Trends 

10 Biotechnol 3 (11): 286-290, 1985), in the paper industry (Mora et al., J. 
Wood Chem. Technol, 6: 147-165, 1986) or in the production of 
chemical compounds from hemicellulose (Reilly P. J., 1981, Xylanases: 
structure and function in trends in the biology of fermentations for fuels 
and chemicals. A.J. Hollaender (Ed), Plenum, New York). 

15 The technical feasibility of such applications has been 

assessed chiefly using enzymes produced by mesophilic fungi. However, 
such applications could be facilitated by the use of fungi possessing 
better temperature stability. 

Various bacteria and enzymes are known for the production of 

20 xylanases ( see, in particular, Wong et al., Microbiological Reviews, 52, 
No 3 305 317, 1988). Hitherto, the highest yields of enzymes have been 
obtained from fungi (Yu et al. Enzymo Microb. Technol. 9: 16-24, 1987). 
However, hyperproductive strains of Bacillus have already been 
described (Okazaki et al. Appl. Microbiol. Biotechnology, 19: 335-340, 

25 1984; Okazaki et al., Agric. Biol. Chem. 49, 2033-2039, 1985). Such 
thermophilic species of Bacillus which degrade xylan can be good 
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candidates for the industrial production of xylanases on account of their 
high growth rate and of their genetics being well known. 

The xylanases isolated by Okazaki et al. emanate from two 
Bacillus strains referred to as W1 and W2 by the authors. In each of 
5 these strains, two components of the xylanase activity, referred to as I 
and II, have been demonstrated. The components I degrade xylan to 
xylobiose and to oligomers having a higher degree of polymerization, 
while the components II produce xylose in addition to the above 
compounds. 

io The components I (referred to as W1.I and W2.I) have 

respective molecular weights of 21.5 kDa and 22.5 kDa, as well as 
isoelectric points of 8.5 and 8.3. The components II (W1.ll and W2.ll) 
have, for their part, respective molecular weights of 49.5 kDa and 50 
kDa. 

15 The two components I and II are inhibited by Hg" + ions and, to 

a lesser extent, by Cu'* + . 

Many other xylanases have been isolated from various species 

of Bacillus, Clostridium, Aspergillus, Streptomyces or Trichoderma, inter 

alia (Wong et al., 1988, cited above). 
20 Thus, the resume of Japanese Patent JP 130 96 84 

(RIKAGAKU KENKYSHO) relates to a type Wll xylanase having a 

molecular weight of 50 kD or 42 kD. No isoelectric point is mentioned for 

this xylanase. 

A paper by RAJARAM et al., ( Applied Microbiology and 
25 Biotechnology, Vol. 34, n°1, October 1990, pages 141-144) relates to a 
Bacillus strain isolated in the natural environment and which produces a 
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xylanase having optimal activity at between 60°C and 70°C and at a pH 
of between 6 and 7. This enzyme is characterized neither by its 
molecular weight nor by its isoelectric point. This strain produces, in 
addition, other enzymes such as cellulases. 
5 Another resume of a Japanese patent in the name of 

RIKAGAKU KENKYUSHO (JP-85 118 644) describes a xylanase having 
optimal activity at a pH of between 6 and 7. This enzyme is considered 
to have a molecular weight, determined by ultrafiltration, of between 50 
and 100 kD. No isoelectric point is mentioned in this resume. 

10 A paper by GRONINGER et al. (Enzyme Microbiology and 

Technology, Vol. 8, May 1986, pages 309-314) relates to a Bacillus 
stearothermoplilus strain isolated from mud and which produces a heat- 
stable xylanase. The enzyme is characterised as having optimal activity 
at 78°C and at a pH value of 7.5. This enzyme is characterized neither 

15 by its molecular weight nor by its isoelectric pH. 

The industrial production of xylanases is impeded by the 
simultaneous presence of contaminant activities such as cellulases, 
leading to additional purification costs. 

As far as the Applicant is aware, the best productivity with 

20 respect to endoxylanase obtained with a microorganism not producing 
cellulase has been obtained from a Streptomyces lividans mutant devoid 
of cellulase activity after introduction of a plasmid carrying genes coding 
for xylanases A and B. Productivities of the order of 6000 to 10,000 
lU.I/h were observed in the culture media. It should nevertheless be 

25 noted that, in this case, problems linked to the failure of xylanase A to 
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hydrolyse insoluble xylan, and of thermal stability in the case of xylanase 
B, were encountered (Kluepfel et al. Biochem. J. 267, 47-50, 1990). 

None of these enzymes hence possessed, as far as the 
Applicant is aware, features making an industrial application possible, 
5 that is to say good thermal stability, a large capacity for degradation of 
substrates, a means of production by hyperproductive strains and 
possibilities to modify the aminoacids sequence. 

Another xylanase has been isolated from a Bacillus strain 
deposited at the CNCM Culture Collection under the number 1-1017. It 
10 has been described in EP 0.573.536 application filed under the name of 
the present applicant. This xylanase displays a temperature stability. 
However its sequence has not been determined and it can only be 
produced by growing the said Bacillus strain. 

Thus it was not possible to modify its protein sequence in the 
15 aim of improving its properties. 

The applicant has thus cloned the gene encoding this xylanase 
and has sequenced it. 

The subject of the present invention is, thus a thermophilic 
xylanase having a sequence sharing an homology of at least 80%, 
20 preferentially 90%, and more preferentially 95%, with the following one 
(SEQ ID N°2): 

Asn Thr Tyr Tip Gin Tyr Trp Thr Asp Gly He Gly Tyr Val Asn Ala Thr 
Asn Gly Gin Gly Gly Asn Tyr Ser Val Ser Trp Ser Asn Ser Gly Asn 
25 Phe Val He Gly Lys Gly Trp Gin Tyr Gly Ala His Asn Arg Val Val Asn 
Tyr Asn Ala Gly Ala Trp Gin Pro Asn Gly Asn Ala Tyr Leu Thr Leu 
Tyr Gly Trp Thr Arg Asn Pro Leu He Glu Tyr Tyr Val Val Asp Ser Trp 
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Gly Ser Tyr Arg Pro Thr Gly Asp Tyr Arg Gly Ser Val Tyr Ser Asp Gly 
Ala Trp Tyr Asp Leu Tyr His Ser Trp Arg Tyr Asn Ala Pro Ser He Asp 
Gly Thr Gin Thr Phe Gin Gin Tyr Trp Ser Val Arg Gin Gin Lys Arg 
Pro Thr Gly Ser Asn Val Ser He Thr Phe Glu Asn His Val Asn Ala Trp 
5 Gly Ala Ala Gly Met Pro Met Gly Ser Ser Trp Ser Tyr Gin Val Leu Ala 
Thr Glu Gly Tyr Tyr Ser Ser Gly Tyr Ser Asn Val Thr Val Trp 

The degree of homology can be determined using pairwise 

alignment methods such as the GAP and the BESTFIT programs of the 

Genetics Computer Group, Inc. Package (GCG) Fast database 
10 searching programs such asd FASTA and BLAST (included in the GCG 

package) can be used for the comparison of a sequence to all available 

sequences of a database. 

For the definition of the term "homology 11 one can refer to 

Altschul et al. (1990, J. Mol. Biol. 215, 403-410) and Doolittle R.F. (Ed) 
15 (Molecular evolution: computer analysis of protein and nucleic acid 

sequences. Methods in Enzymology 183, Academic Press, London, 

1990). 

Xylanases falling under this definition are in particular the ones 
in which one or a few aminoacids have been changed, compared to the 
20 sequence SEQ ID N°2. 

Such changes in the aminoacids are preferentially the ones 
which consist in the substitution of one aminoacid by another one which 
has substantially the same properties such as defined by Lehninger 
(page 70, 2nd french edition, 1979, Flammarion ed.) or in its more 
25 recent re-edition. It is reminded that the twenty basic aminoacids are 
classified in four groups depending on their properties: 

- the ones having a hydrophobic or non-polar lateral chain, 
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- the ones having a polar lateral chain not charged, 

- the ones having a negatively charged lateral chain, and 

- the ones having a positively charged lateral chain. 

Such a xylanase can possess a molecular mass of 
5 approximately 22 kDa, determined by SDS-PAGE, or 20.7 kDa 
determined by mass spectroscopy and an isoelectric point of 
approximately 7.7. 

This enzyme advantageously displays great stability at 60°C 
for at least 24 hours, and a pH of optimal activity within the range 
10 extending from 4.8 to 7, and preferably approximately 6. 

It should be noted that the pHj of this enzyme is fairly high but 
nevertheless lower than the pHj of the xylanases of similar molecular 
mass produced by some bacilli, in particular those described by Okazaki 
et al. (1985, publication cited above). 
15 pH 6 corresponds to an optimal pH, but the activity remains 

greater than 80% in the range between 4.8 and 7. 

Another subject of the present invention is a nucleotidic 
sequence coding for the said xylanase. This sequence can be DNA or 
RNA sequence and in particular c DNA, plasmidic DNA, genomic DNA or 
20 m RNA 

Preferentially such a nucleotidic sequence is the following one 
(SEQ ID N°1): 

aacacgtactggcagtattggacggatggcatcgggtatgtgaacgcgacgaacggaca 
25 aggcggcaactacagcgtaagctggagcaacagcggcaacttcgtcatcggcaagggct 
ggcaatacggtgcgcacaaccgggttgtcaactacaacgccggcgcatggcagccgaa 
cggcaacgcgtatctgacgctgtacggctggacgcgcaacccgctcatcgaatactacgt 
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cgtcgacagctggggcagctaccgcccgaccggcgactaccggggcagcgtgtacagc 

gacggcgcatggtatgacctctatcacagctggcgctacaacgcaccgtccatcgacggc 

acgcagacgttccaacaatactggagcgttcgtcagcagaaacgcccgacgggcagcaa 

cgtctccatcacgttcgagaaccacgtgaacgcatggggcgctgccggcatgccgatgg 

gcagcagctggtcttaccaggtgctcgcaaccgaaggctattacagcagcggatactcca 

acgtcacggtttggtaa 

The xylanase according to the present invention can thus also 
be produced by a microorganism strain, appropriately chosen, 
transformed by a vector coding for the said xylanase. The said 
microorganism is grown in an appropriate medium and thereafter the 
xylanase is isolated as described below. 

Such a microorganism is chosen in order to be able to produce 
and to excrete it. 

It can be a bacteria such as Escherichia coli or Bacillus sp. 

The vector coding for the xylanase is chosen in order to be 
expressed in the said microorganism. It can be a plasmid, such as 
pBluescript or preferentially pET.. 

Systems of expression suitable for the production of the 
xylanase according to the present invention are in particular listed in 
D.V. Goeddel ((Ed). Gene expression technology. Methods in 
Enzymology, 185, Academic Press, London, 1990). 

The pET E.coli expression system is one of the most widely 
used bacterial expression system (Studier et al., 1990, Meth. Enzymol., 
185, 60-89). 

The expression of recombinant xylanase can be achieved in 
particular as following. The DNA fragment encoding the mature 
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xylanaxe, i.e. the sequence SEQ ID N°1, is engineered by PCR so as to 
generate Ndel and Bamhl terminal restriction sites suitable for 
expression in the T7-based vector pET3a. The PCR fragment is cloned 
blunt-ended into pBluescript (Stratagene Cloning Systems) before 

5 cloning as a Ndel/BamHI fragment into pET3a. 

The recombinant enzyme is expressed from pET3a in the E. 
coli strain BL21 (DE3) carrying pLysS. Cultures are grown in L-broth 
containing ampicillin (100 pg/ml) and chloramphenicol (25 pg/ml) until an 
Aeoo °f 3 was reached, before induction with 0.1 mM isopropyl I5-D- 

10 thiogalactoside (IPTG) for 3 hours. Large-scale cultures for protein 
purification are centrifuged and the cells are lysed in a buffer containing 
50 mM Tris-HCI, pH 8.0, 1 mM EDTA by passage through a French 
press (10 MPa). The same process described in EP 0.573.536 for 
purifying the xylanase from the culture supernatant of the Bacillus can be 

15 used. One can expect about 1 mg of recombinant protein per ml of cell 
culture. 

An advantage of this way of production of the xylanase is that 
the nucleoditic sequence can be mutated before to be introduced in the 
microorganism. It is therefore easy to obtain various mutations 
20 corresponding to xylanases having various sequences. 

This was not possible with the thermostable xylanases already 
described in the prior art, such as the one described by GRUNINGER et 
al. (previously cited), since their sequences were not known. 

For carrying out the present invention, in particular this way of 
25 production, the man skilled in the art can refer to the following manual 
which describes the usual techniques of molecular biology: Maniatis et 
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al. 1982- Molecular Cloning: a Laboratory Manual, Cold Spring Harbor 
Ed. New York, or one of its more recent editions. 

The xylanase as described above can be obtained through a 
process, comprising the following steps: 
5 - concentration of the microorganism culture supernatant, 

- passage through an ion exchange column such as a column 
of Q Sepharose Fast Flow (Pharmacia), 

- passage through a hydrophobic interaction column such as a 
column of Phenyl-Sepharose (Pharmacia). 

10 Concentration of the supernatant can , in particular, be 

performed by ultrafiltration through a polysulphone membrane having an 
exclusion threshold above 10 kDa. 

This process enables a substantially pure xylanase preparation 
to be obtained. 

15 The xylanase described above can be produced through a 

process comprising the steps: 

- of growth of the bacteria in a medium containing a growth 
substrate such as glucose, and 

- of production of xylanase induced by feeding the culture 
20 continuously with suitable amounts of xylo-oligosaccharides. 

The subject of the present invention is also the use of the 
xylanase described above in the bleaching of paper pulp. 

An advantage of this xylanase lies in the fact that the degree of 
hydration of the paper pulp is of little importance. It is not obligatory to 
25 dilute the pulp greatly in order to obtain good enzymatic attack. The use 
of this xylanase as an auxiliary in the bleaching of paper pulp is all the 
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more advantageous for the fact that the preparations are devoid of 

cellulase contaminants. 

This xylanase may also be used for the preparation of xylose or 

of xylo-oligosaccharides from raw materials of plant origin, which are 
5 inexpensive and renewable raw materials (for example maize cobs). 

Other uses of xylanases have been mentioned in the literature. 

The review by Zeikus et al. (Thermostable saccharidases New Sources 

uses and Biodesigns in "Enzymes in biomass conversion", Leatham and 

Himmel, ACS Washington D.C. , 1991) lists the main uses of xylanases. 
10 They are mainly used in food manufacture, where their properties enable 

bread-making, the clarification of fruit juices and wines and the 

nutritional qualities of cereal fibres to be improved, and in the production 

of thickeners for foodstuffs. 

The second sphere of application relates to the paper pulp and 
15 fibre industries, where they are used for the bleaching of pulps, the 

manufacture of wood pulp and the purification of fibres for rayon 

manufacture. 

Uses are also noted in poultry feeding, in which uses xylanases 
are employed in order to decrease the viscosity of the feeds (Van 
20 Paridon et al. Xylans and Xylanases, International Symposium, 
Wageningen, 8-11 December 1991; Bedford and Classen H.L Xylans 
and Xylanases, International Symposium, Wageningen, 8-1 1 December 
1991). 

The use of xylanase in enhancing the value of by-products of 
25 the paper pulp industry is more specifically mentioned in the paper by 
Biely (Trends in Biotechnology, Vol.3, No.11, 1985). 
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Mention may also be made of the two European Patents EP 
228,732 and EP 227,159, which relate to the use of Xylanases for 
improving the filterability of glucose syrup and of beer, respectively. 

The possibility of using the xylanases for the production of 
5 chemical compounds from hemicellulose (Reilly, cited above) will also be 
noted. 

These various publications show that the xylanases which are 
the subject of the present invention may be used in a large number of 
applications. 

10 The present invention is illustrated, without, however, being 

limited, by the application examples which follow. 

Fig. 1 illustrates the homology degree between the xylanase 
according to the present invention (XYL20) and other xylanases. 

Fig.2 represents HCA plots of four xylanases, including the one 
15 of the present invention. 

On fig.3 is indicated the prediction of secondary structural 
elements for the xylanase of the present invention. 
EXAMPLE 1: 

Cloning and sequencing of the gene encoding for the 
20 xylanase. 

1. Materials and methods . 

- Strains, vectors and culture conditions . 

The strain M017 was grown at 55°C in the liquid medium 
described in examples 1 and 2 of patent application EP-0.573.536. The 
25 SURE, XL1-Blue and XLOLR Escherichia coli strains, the vectors ZAP 
Express and pBluescript and the filamentous helper phage ExAssist™ 
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were all purchased from Stratagene Cloning Systems E. coli cells were 
grown in LB medium at 37°C. The medium was solidified by addition of 
1.5% (w/v) of Bacto-agar. 

- Preparation of DNA 

5 Bacterial genomic DNA was extracted from 1-1017 according to 

the method of Yang et al. (Appl. Environ. Microbiol, 1988, 54, 1023- 
1029). 

- Obtention of a partial genomic clone coding for the 

XYL2Q . 

10 PCR was used to amplify a region of the chromosomal DNA 

coding for the XYL20. The nucleotide sequence of the forward primer 
(P1) (SEQ ID N°3) was AAYACNTAYTGGCARTAYTGGACNGAYGG 
(derived from the sequence NTYWQYWTDG in the N-terminus end of 
the XYL20); that of the reverse primer (P2) (SEQ ID N°4) was 

15 YTG WC KN AC RCTC C A RTAYTG (corresponding to the sequence 
QYWSVRQ, a conserved region near the C-terminus of other xylanases 
from different Bacillus species). PCR was performed with chromosomal 
DNA as a template and the primers P1 and P2 on a thermocycler 
(Perkin-Elmer. France) with the following temperature profile: 1 min 94°C 

20 -1 min 50°C - 2 min 72°C for 35 cycles. The PCR product was purified 
on a 1 % agarose gel and was ligated into EcoRV-digested pBluescript. 
The chimaeric plasmid (pBX20) was used to transform SURE cells. 
Recombinant cells were selected on L-agar plates containing ampicillin 
(40 M9/ml), isopropyl-H-D-thiogalactoside (0.2 mM) and 5-bromo-4- 

25 chloro-3-indolyl-l3-D-galactoside (40 pg/ml). 
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- Construction of a B. sp 1-1017 genomic library in ZAP 
Express. 

Chromosomal DNA was partially digested with Sau3AI and the 
resulting DNA fragments in the size range 1.5-8 kb were purified and 
5 ligated into BamHI-digested ZAP Express. The library was constructed 
using XL1-Blue cells as indicated by the manufacturer. 

- Screening of the genomic library . 

pBX20 was digested with BamHI and Hindlll and the DNA 
insert was purified and labeled with digoxigenin (Boehringer Mannheim) 

10 following the instructions of the manufacturer. The labeled DNA was 
used to screen the genomic library. After the third screening, positive 
lambda plaques were isolated and the recombinant plasmid pBK-CMV 
inserted in the vector ZAP Express was excised using the filamentous 
phage ExAssist and then recovered by infecting the XLOLR cells in the 

15 presence of kanamycin (1 0 pg/ml). 

- DNA sequence analysis . 

Plasmid preparations for sequence determination were 
performed using Qiagen tip 100 (Diagen, Coger, France). Double- 
stranded DNA sequencing was done by the dideoxy chain termination 
20 method of Sanger et al (Proc. Nat. Acad. Sci. USA, 1977, 74, 5463- 
5467), using the Sequenase™ 2.0 DNA sequencing kit from United 
States Biochemical . Both universal and specific primers were used to 
sequence the sense and antisense strands of inserts in the plasmids. 

- Protein sequence analysis and Hydrophobic Cluster 
25 Analysis . 
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The sequence Analysis Software Package by Genetics 
Computer, Inc. (The GCG Package) was used throughout this work. In 
particular, multiple alignments were performed using the Pileup program 
and pairwise comparisons were done using the Bestfit program. 

5 Hydrophobic Cluster Analysis (HCA) is a method to compare 

amino acid sequence (Gaboriaud et al. FEBS Lett., 1987, 224, 149-155) 
which is derived from the theory of Lim (J. Mol. Biol, 1974, 88, 857-872). 
The method involves the drawing of the sequence of a theoretical A-helix 
where the hydrophobic residues form clusters. The shape, size and the 

io relative position of the clusters can be compared and the sequence 
similarity, when it exists, may be readily revealed. Conversion of the 
amino acid sequences into the 2D-helical plot required by the method 
was made using the HCA-Plot software. 
2) Results . 

15 In a first attempt to determine the xylanase sequence, the 

xylanatic activity has been tested in the genomic library. However this 
approach has failed to conduct to the isolation of the xylanase according 
to the present invention. 

In a second and successful attempt, the sequence of this 

20 xylanase has finally been determined. 

It is reminded that the amino acid sequence of the N-terminal 
region of the xylanase has been determined (example 4 of EP 
0.573.536). It exhibits 67% of identity with the N-terminus of the 
xylanases produced by Bacillus subtilis and Bacillus circulans. 

25 Besides, these enzymes which belong to the G family 

(according to the classification of Gilkes et al. (Microbiol. Rev., 1991, 55, 
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303-315)) share some conserved regions along their polypeptide chains. 
Among others, one region consisting of 7 amino acids occurs near the 
C-terminus. 

A part of the gene coding for the xylanase has been amplified 
5 by PCR using two degenerate primers, P1 and P2, corresponding to the 
N-terminus end of the xylanase and to a conserved region near the C- 
terminus, respectively. A 450 bp DNA fragment was obtained and 
cloned into the vector pBluescript. The sequence of the resultant 
plasmid pBX20 can be attributed without any doubt to the xylanase. To 
io get the complete gene of xylanase, a genomic library of B. sp 1-1017 was 
prepared in E. coli XL1-blue using the phage vector ZAP Express. This 
library was screened with the insert of the plasmid pBX20. One positive 
plaque, designated pBX52A2, was shown to contain the complete gene 
of the xylanase. 

15 The nucleotidic sequence of this clone is indicated in the 

sequence list hereunder as SEQ ID N° 1. 

The complete protein sequence of the xylanase is shown as 
SEQ ID N°2 is the sequence list hereunder. 

The results of a FASTA search in the protein data bases PIR 

20 and Swiss-Prot yielded 36 xylanase sequences. As shown in the table 
the xylanase shares sequence homology with other xylanases of the G 
family. The best scores (73% of identity) are observed as expected with 
the xylanases from B. subtilis and B. circulans. This shows 
unambiguously that the xylanase according to the present invention is a 

25 new protein which possesses a unique amino acid sequence. 
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For comparative purposes, only representative xylanases from 
different organisms (the ones in bold types on the table ) are listed in the 
multiple sequence alignment shown in figure 1. The analysis of a primary 
sequence alignment of 14 xylanases of the G family indicates the 
residues which are conserved throughout the family. As reported 
previously by Wakarchuk et al., (Protein ScL, 1988, 3, 467-475), 2 
glutamic acid residues are absolutely conserved in this family of 
xylanases. The present multiple alignment suggests that Glu76 and 
Gtu169 are the catalytic residues of the xylanase. 

Alignment was then reconsidered by the HCA method 
(Gaboriaud et al., 1987, previously cited), which allows for a rapid 
identification of the clusters and an easy alignment (figure 2). The 
identification of the clusters is straightforward even if there are some 
variations in cluster shapes. The alignment was checked on crystalline 
structures for three xylanases indicated in figure 1 , and the extensions of 
the B-strands are indicated on the HCA plots. Vertical lines have been 
inserted to delimit the extension of B-strands. The most conserved 
hydrophobic clusters have been shadowed for better visualization. 

For the HCA plot of xylanase according to the present invention 
(XYL20), the extension of the B-strands was deduced from the other 
plots. It appears clearly that the secondary structure of XYL20 consists 
essentially of B-strands and only one A-helix. These elements are so 
organized to display the characteristic folding of a greek key. We can 
also inferred that the following aromatic residues: Tyr 67 and Tyr 78 on 
one B-strand and Tyr 163 on another B-strand are likely to be involved in 
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the orientation and binding of xylan polysaccharides, prior their 
hydrolysis. 

The figure 3 summarizes the prediction of the occurrence of 
secondary structural elements which can be proposed for the xylanase 
5 according to the present invention on the basis of its primary structure 
and a thorough protein sequence analysis. These structural predictions 
can be translated into a putative three-dimensional model to be used in 
Molecular Isomorphism Replacement in view of solving the crystalline 
structure of this xylanase. 
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Table Summary of xylanases cerived from i KASTA se?.ich ir r he Protein Data Bases Swiss 
Prot (release 3 1 .0) and PIR (release 44.0). 



AC(1) 

PI 8429 

S39I57 

P09850 
SO 1 734 

P26220 
JS0591 

S47512 

JS0590 
P265I5 

S43919 

A44594 
A4-I595 

P36217 
S39I54 
S39S83 



date 
18.02.94 

07.06.90 

111.02.95 
14.07.94 

13.01.95 

10.0X94 
01.08.92 

25.10.y4 

14.06.94 
27.0G.94 

01.06.94 
06.03.94 
27.05.94 



origin 
Bin-Mux subtilis 

Bacillus circuluns 

Streptomyces lividans 

Streptomyces, sp ECJ 
Streptomyces lividans 

Humicula insnleus 
Trichoilenna viritle 

Tricltoderma reesei 



XYNA 



XYLC 



XYLB 
XYNB 



XYL II A 
XYLQB 

XYN2 
XYL I 
XYL n 



crvst % nb M\v 
a.a 

73 213 23,345 



* 73 213 23,345 



*0 240 25.673 



58 240 

58 333 

58 333 35.426 

55 227 

54 190 

54 190 

54 222 24,172 



P35809 
A44597 
S38973 

Q06562 

A44593 

PW1694 
WWBSXP 

JQ1935 
P33558 

PI7137 
S12745 

P33557 
S430I5 

JCII98 

P362IS 
$391 55 

548865 
S4952S 

$24754 
P29I27 

P55S 1 1 



111.06.94 
27.06.94 
18.02.94 

111.02.95 

27.06,94 

21.07.94 
20.09.84 

20.08.94 
01.02.95 

OI.OH.94 

30.09.93 

01.02.94' 

20.05.94 



01.06.94 
06.03.94 



30.09.93 
01.12.92 



Scluzophylbun conntmue 

Coch liuhtdus carbon mn 
Trichot lei ma harziwmm 
Bacillus pnmilus 

Clostridium stercariirimn 
Clostridium acetohutylictun 
Aspergillus uwamori 



05.03.93 Asfviyilfits niyer 



TtictuutctTua ivesei 



1 3 .0 1 .95 Seocallimastix frontalis 
:0.02.95 



SeiKtillinttisiix patricium 



0 1 .02.95 ftbmhacter suvchmgeues 



XYNA 



XYL A 



XYNC 



XYNC 

XYNI 
XYL2 



XYL A 
XYNA 



( 1 ) : AC : Accession Number 

(2) : tryst : crystal loyrapluc data available 

(3) : % : percentage ol* identity 

Only the sequences in bold have been used in Ok* alignment study, on figure 2 



54 197 20,978 

53 221 23,728 

53 190 

50 228 25,521 



48 511 

48 511 56.519 
46 261 29,032 



40 211 22.560 

40 211 

40 211 

39 229 24.583 



37 266 

35 607 

35 607 

34 607 66.175 

34 6U8 66.415 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: INSTITUT NATIONAL DE LA RECHERCHE 

AGRONOMIQUE(INRA) 

(B) STREET: 147, rue de I'Universite 

(C) CITY: PARIS 

(E) COUNTRY: FRANCE 

(F) POSTAL CODE (ZIP): 75341 

(G) TELEPHONE: 42 75 90 00 

(H) TELEFAX: 42 75 94 28 

(ii) TITLE OF INVENTION: XYLANASE, OLIGONUCLEOTIDIC SEQUENCE ENCODING 

IT AND ITS USES 

(iii) NUMBER OF SEQUENCES: 4 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 

(v) CURRENT APPLICATION DATA: 
APPLICATION NUMBER: US 08/543.956 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 549 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus sp 

(B) STRAIN: 1-1017 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATIONS. .547 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

AAC ACG TAC TGG CAG TAT TGG ACG GAT GGC ATC GGG TAT GTG AAC GCG 48 
Asn Thr Tyr Trp Gin Tyr Trp Thr Asp Gly He Gly Tyr Val Asn Ala 
1 5 10 15 
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ACG AAC GGA CAA GGC GGC AAC TAC AGC GTA AGC TGG AGC AAC AGC GGC 96 
Thr Asn Gly Gin Gly Gly Asn Tyr Ser Val Ser Tip Ser Asn Ser Gly 
20 25 30 

AAC TTC GTC ATC GGC AAG GGC TGG CAA TAC GGT GCG CAC AAC CGG GTT 144 
Asn Phe Val lie Gly Lys Gly Trp Gin Tyr Gly Ala His Asn Arg Val 
35 40 45 

GTC AAC TAC AAC GCC GGC GCA TGG CAG CCG AAC GGC AAC GCG TAT CTG 192 
Val Asn Tyr Asn Ala Gly Ala Trp Gin Pro Asn Gly Asn Ala Tyr Leu 
50 55 60 

ACG CTG TAC GGC TGG ACG CGC AAC CCG CTC ATC GAA TAC TAC GTC GTC 240 
Thr Leu Tyr Gly Trp Thr Arg Asn Pro Leu lie Glu Tyr Tyr Val Val 
65 70 75 80 

GAC AGC TGG GGC AGC TAC CGC CCG ACC GGC GAC TAC CGG GGC AGC GTG 288 
Asp Ser Trp Gly Ser Tyr Arg Pro Thr Gly Asp Tyr Arg Gly Ser Val 
85 90 95 

TAC AGC GAC GGC GCA TGG TAT GAC CTC TAT CAC AGC TGG CGC TAC AAC 336 
Tyr Ser Asp Gly Ala Trp Tyr Asp Leu Tyr His Ser Trp Arg Tyr Asn 
100 105 110 

GCA CCG TCC ATC GAC GGC ACG CAG ACG TTC CAA CAA TAC TGG AGC GTT 384 
Ala Pro Ser lie Asp Gly Thr Gin Thr Phe Gin Gin Tyr Trp Ser Val 
115 120 125 

CGT CAG CAG AAA CGC CCG ACG GGC AGC AAC GTC TCC ATC ACG TTC GAG 432 
Arg Gin Gin Lys Arg Pro Thr Gly Ser Asn Val Ser He Thr Phe Glu 
130 135 140 

AAC CAC GTG AAC GCA TGG GGC GCT GCC GGC ATG CCG ATG GGC AGC AGC 480 
Asn His Val Asn Ala Trp Gly Ala Ala Gly Met Pro Met Gly Ser Ser 
145 150 155 160 

TGG TCT TAC CAG GTG CTC GCA ACC GAA GGC TAT TAC AGC AGC GGA TAC 528 
Trp Ser Tyr Gin Val Leu Ala Thr Glu Gly Tyr Tyr Ser Ser Gly Tyr 
165 170 175 

TCC AAC GTC ACG GTT TGG TAA 549 
Ser Asn Val Thr Val Trp 
180 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 182 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
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Asn Thr Tyr Trp Gin Tyr Trp Thr Asp Gly (le Gly Tyr Val Asn Ala 
1 5 10 15 

Thr Asn Gly Gin Gly Gly Asn Tyr Ser Val Ser Trp Ser Asn Ser Gly 
20 25 30 

Asn Phe Val Me Gly Lys Gly Trp Gin Tyr Gly Ala His Asn Arg Val 
35 40 45 

Val Asn Tyr Asn Ala Gly Ala Trp Gin Pro Asn Gly Asn Ala Tyr Leu 
50 55 60 

Thr Leu Tyr Gly Trp Thr Arg Asn Pro Leu He Glu Tyr Tyr Val Val 
65 70 75 80 

Asp Ser Trp Gly Ser Tyr Arg Pro Thr Gly Asp Tyr Arg Gly Ser Val 
85 90 95 

Tyr Ser Asp Gly Ala Trp Tyr Asp Leu Tyr His Ser Trp Arg Tyr Asn 
100 105 110 

Ala Pro Ser He Asp Gly Thr Gin Thr Phe Gin Gin Tyr Trp Ser Val 
115 120 125 

Arg Gin Gin Lys Arg Pro Thr Gly Ser Asn Val Ser lie Thr Phe Glu 
130 135 140 

Asn His Val Asn Ala Trp Gly Ala Ala Gly Met Pro Met Gly Ser Ser 
145 150 155 160 

Trp Ser Tyr Gin Val Leu Ala Thr Glu Gly Tyr Tyr Ser Ser Gly Tyr 
165 170 175 

Ser Asn Val Thr Val Trp 
180 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic.degenerate 
oligonucleotide" 

(iii) HYPOTHETICAL: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
AAYACNTAYT GGCARTAYTG GACNGAYGG 29 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic.degenerate 
oligonucleotide" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
YTGWCKNACR CTCCARTAYT G 21 
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CLAIMS: 

1. Xylanase having a sequence, sharing an homology of at 
least 80%, and preferentially 90%, with the one having the following 

5 sequence SEQ ID N°2: 

Asn Thr Tyr Trp Gin Tyr Trp Thr Asp Gly lie Gly Tyr Val Asn Ala Thr 
Asn Gly Gin Gly Gly Asn Tyr Ser Val Ser Trp Ser Asn Ser Gly Asn 
Phe Val He Gly Lys Gly Trp Gin Tyr Gly Ala His Asn Arg Val Val Asn 

io Tyr Asn Ala Gly Ala Trp Gin Pro Asn Gly Asn Ala Tyr Leu Thr Leu 
Tyr Gly Trp Thr Arg Asn Pro Leu He Glu Tyr Tyr Val Val Asp Ser Trp 
Gly Ser Tyr Arg Pro Thr Gly Asp Tyr Arg Gly Ser Val Tyr Ser Asp Gly 
Ala Trp Tyr Asp Leu Tyr His Ser Trp Arg Tyr Asn Ala Pro Ser He Asp 
Gly Thr Gin Thr Phe Gin Gin Tyr Trp Ser Val Arg Gin Gin Lys Arg 

15 Pro Thr Gly Ser Asn Val Ser He Thr Phe Glu Asn His Val Asn Ala Trp 
Gly Ala Ala Gly Met Pro Met Gly Ser Ser Trp Ser Tyr Gin Val Leu Ala 
Thr Glu Gly Tyr Tyr Ser Ser Gly Tyr Ser Asn Val Thr Val Trp 

2. Xylanase according to claim 1, which is stable at 
approximately 60 °C for 24 hours. 

20 3. Xylanase according to one of the claims 1 and 2, which is 

secreted by a microorganism strain, appropriately chosen, transformed 

by a vector encoding the said xylanase. 

4. Nucleotidic sequence coding for the xylanase according to 

one of the claims 1 to 3. 
25 5. Nucleotidic sequence having the following sequence SEQ ID 

N°1: 

aacacgtactggcagtattggacggatggcatcgggtatgtgaacgcgacgaacggaca 
aggcggcaactacagcgtaagctggagcaacagcggcaacttcgtcatcggcaagggct 
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ggcaatacggtgcgcacaaccgggttgtcaactacaacgccggcgcatggcagccgaa 
cggcaacgcgtatctgacgctgtacggctggacgcgcaacccgctcatcgaatactacgt 
cgtcgacagctggggcagctaccgcccgaccggcgactaccggggcagcgtgtacagc 
gacggcgcatggtatgacctctatcacagctggcgctacaacgcaccgtccatcgacggc 
5 acgcagacgttccaacaatactggagcgttcgtcagcagaaacgcccgacgggcagcaa 
cgtctccatcacgttcgagaaccacgtgaacgcatggggcgctgccggcatgccgatgg 
gcagcagctggtcttaccaggtgctcgcaaccgaaggctattacagcagcggatactcca 
acgtcacggtttggtaa 

6. Vector, in particular plasmid, comprising a sequence 
io according to one of the claims 4 and 5. 

7. Process for the production of a xylanase having the 
sequence SEQ ID N°2, or sharing an homology of at least 80% and 
preferentially 90%, with SEQ ID N°2 wherein: 

- a microorganism strain appropriately chosen and transformed 
15 by a vector encoding the said xylanase according to claim 6 is grown in 

an appropriate medium, and 

- the xylanase is isolated. 
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F1G.1 



1/5 



50 



xyl20 
Bac sub 
Bac cir 
Str liv 
Str ec3 
Hum ins 
Tri vir 
Tri ree 
Sch com 
Coc car 
Bac pum 
Clo ste 
Clo ace 
Asp awa 



xy!20 
3ac sub 
3ac cir 
Str liv 
Str ec3 
Hum ins 
Tri vir 
Tri ree 
Sch com 
Coc car 
Bac pum 
Clo ste 
Clo ace 
Asp awa 



MFKFKKNF LVG...LSAA 

MFKFKKNF LVG. . . L5AA 

. MQQDGT QQDRIKQSPA PLNGMSRRGF LGGAGTLALA 
. MQQDGK RQDQNQQIJPA PFSGLSRRGF LGGAGTVALA 
MV SLKSVLAAAT AVSSAIAAPF 



.MV SFTSLLAASP PSR ASC 



MV SFTSIITAAV AATGALAAP. 

MNLRKLRLLF VMCIGLTLIL TAV 

MKRKVK KMAAMATSII MAIMIILKSI 

MLRRKV IFTVLATLVM TSLTIVDNTA FAATNLNTTE STFSKEVLST 

MKVT 



51 



LMSISLFSAT 
LMSISLFSAT 
TASGLLLPGT 
TASGLLLPST 
DFVPRDNSTA 



RPAAEVESVA 



acyl20 
Bac sub 
Bac cir 
Str liv 
Str ec3 
Hum ins 
Tri vir 
Tri ree 
Sch com 
Coc car 
3ac pum 
Clo ste 
Clo ace 
Asp awa 



ATDVS 

P 

PV 

QKTYSAFNTQ 
AASAGLLGHA 

101 

VSWS.N.SGN 
VNWS . N . TGN 
VNWS. N. TGN 
TQWT . N . CGN 
TQWT.N.CGN 
VRWR . H . TGN 
VNWS.N.SGN 
VNWS.N.SGN 
LTWSGN . NGN 
VSWG.S.GGN 
AGWN.N.IGN 
CQWS.N.IGN 
CQWS . N'. IGN 
MWEDGVSSD 



NT. . . 

ASAASTD. . . 
ASAASTD. . . 
AHAATTITTN 
.^HAATTITTN 
LQARQVTPNA 

QTIGPG 

VEKRQTIQPG 

SGTPSS 

LVARQNTPNG 
AHARTITNNE 
LAGRIIYDNE 
AAPKTITSNE 
FAAPVPQPVL 



YWQ 

YWQ 

YV7Q 

QTGTDGMYYS 
QTGYDGMYYS 
EGWHNGYFYS 
TGFNNGYFYS 
TGYNNGYFYS 
TGTDGGYYYS 
EGTHNGCFV75 
MGNHSGYDYE 
TGTHGGYDYE 
IGVNGGYDYE 
VSRSAGINYV 



YWTDGIGYVN 

NWTDGGGIVN 

NWTDGGGIVN 

FWT DGGGSVS 

FWTDGGGSVS 

WWSDGGGQVQ 

YWNDGHGGVT 

YWNDGHGGVT 

V7WTDGAGDAT 

WWSDGGARAT 

LWKD . YGNTS 

LWKD. YGNTI 

LWKD. YGNTS 

QNYNGNLADF 



100 
ATNGQGGNYS 
AVNGSGGNYS 
AVNGSGGNYS 
MTLNGGGSYS 
MTLNGGGSYS 
YTNLEGSRYQ 
YTNGPGGQFS 
YTNGPGGQF5 
YQNNGGGSYT 
YTNGAGGSYS 
MTLNNGGAFS 
MELNDGGTFS 
MTLKNGGAFS 
TYDESAGTFS 



FVIGKGWQYG 
FWGKGWTTG 
FWGKGWTTG 
FVAGKGWSTG 
FVAGKGWGNG 
FVGGKGWNPG 
FVGGKGWQPG 
FVGGKGWQPG 
LVGGKGWNPG 
LVGGKGWNPG 
ALFRKGKKFD 
ALFRKGRKFN 
ALFRKGKKFN 
FWGLGWTTG 



150 

N RWNYN. .AG AWQPNGNAYL 

F RTINYN. .AG VWAPNGNGYL 

...F RTINYN.. AG WAPNGNGYL 

GNVRYN...G YFNPVGNGYG 

RTVRY5...G YFNPSGNGYG 

...G RTINYG...G YFNPQGNGYL 
. - - N KVINFS . . . G TYN PNGNSYL 
...N KVINFS... G SYNPNGNSYL 
...S RSISYS...G TYQPNGNSYL 
. . .A RTITYS . . . G TYNYNGNSYL 
STRTHHQLGN ISINYN. .AS . FNPSGNSYL 
SDKTYQELGD IWEYG. . CD . YN PNGNSYL 
DTQTYKQLGN ISVNYD..CN . YQPYGNSYL 
S S NAISYS..AE YSASGSSSYL 



AH. 
SP. 
SP. 
D. . 
GR. 
T. . 
TK. 
TK. 
AA. 
T. . 
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fig.l (cont) 2/5 

151 * 200 

xyl20 TLYGWTRNPL IEYYWDSW. GSYRPT GDYRGSVYSD GAWYDLYHSW 

Sac sub TLYGWTRSPL IEYYWDSW. GTYRPT GTYKGTVKSD GGTYDIYTTT 

Bac cir TLYGWTRSPL IEYYWDSW. GTYRPT GTYKGTVKSD GGTYDIYTTT 

Str liv CLYGWTSNPL VEYYIVDNW. GSYRPT GTYKGTVSSD GGTYDI YQTT 

Str ec3 CLYGWTSNPL VEYYIVDNW. GSYRPT GEYRGTVYSD GGTYDIYKTT 

Hum ins AVYGWTRNPL VEYYVIESY. GTYNPGS . . Q AQYKGTFYTD GDQYDIFVST 

7ri vir SVYGWSRNPL IEYYIVENF. GTYNPST. . G ATKLGEVTSD GSVYDIYRTQ 

Tri ree SVYGWSRNPL IEYYIVENF. GTYNPST. . G ATKLGEVTSD GSVYDIYRTQ 

Sch com SVYGWTRSSL IEYYIVESY . GSYDPSS..A ASHKGSVTCN GATYDILSTW 

Coc car AVYGWTRNPL VXYYWENF. GTYDPSS . . 0 SQNKGTVTSD GSSYKIAQST 

Bac pum CVYGWTQSPL AEYYIVDSW. GTYRPTG..A Y.KGSFYA.D GGTYDIYETT 

Clo see CVYGWTRNPL VEYYIVESW. GSWRPPG..A TPKGTITQWM AGTYEIYETT 

Clo ace CVYGWTSSPL VEYYIVDSW. GSWRPPG..G TSKGTITV.D GGI YDIYETT 

Asp awa AVYGWVNYPQ AEYYIVEDY . GDYNPCS . . S ATSLGTVYSD GSTYQVCTDT 
201 250 

xyl20 RYNAPSIDGT Q.TFQQYWSV RQQKRPTGS . . NVSITFENH VNAWGAAGMP 

Bac sub RYNAPSIDGD RTTFTQYWSV RQSKRPTGS . . NATITFSNH VNAWKSHGMN 

bac cir RYNAPSIDGD RTTFTQYWSV RQSKRPTGS. . MATITFTNH VNAWKSHGMN 

Str liv RYNAPSVEGT K. TFQQYWSV RQSKVTSGS. . . GTITTGNH FDAWARAGMN 

Str ec3 RYNAPSVEGT R . TFDQYWSV RQSKVI.GS. ..GTITTGNH FDAWARAGMN 

Hum ins RYNQPSIDGT R.TFQQYWSI RKNKRV GGSVNMQNH FNAWQQHGMP 

Tri vir RVNQPSIIGT S . TFYQYWSV RRTHRS SGSVNTANH FNAWAQQGLT 

Tri ree RVNQPSIIGT A. TFYQYWSV RRNHRS SGSVNTANH FNAWAQQGLT 

Sch com RYNAPSIDGT Q.TFEQFWSV RNPKKAPGGS ISGTVDVQCH FDAWKGLGMN 

Coc car RTNQPSIDGT R . TFQQYWSV RQNKRS SGSVNMKTH FDAWASKGMN 

Cac pum RVNQPSIIGI . ATFKQYWSV RQTKRT SGTVSVSAH FRKWESLGMP 

Clo ste RVNQPS I DGT .ATFQQYWSV RTSKRT . SGTISVTEH FKQWERMGMR 

Clo ace RINQPSIQGN . TTFKQYWSV RRTKRT SGTISVSKH FAAWESKGMP 

Asp awa RTNEPSITGT S.TFTQYFSV RESTRT SGTVTVANH FNFWAQHGFG 

2S1 " * 293 

acyl20 MGSSWSYQVL ATEGYYSSGY SNVTVW 

3ac sub LGSNWAYQVM ATEGYQSSGS SNVTVW 

Bac cir LGSNWAYQVM ATEGYQSSGS SNVTVW 

Str liv MGQFRYYMIM ATEGYQSSGS SNITVSG 

Str ec3 LGQFQYYMIM ATEGYQSSGS SNITVSG 

Hum ins LG.QHYYQW ATEGYQSSGE SDIYVQTH 

Tri vir LG . TMDYQI V AVEGYFSSGS ASITVS 

Tri ree LG. TMDYQI V AVEGYFSSGS ASITVS 

Sch com LGSEHNYQIV ATEGYQSSGT ATITVT 

Coc car LG.QHYYQIV ATEGYFSTGN AQITVNCP 

Bac pum MGKMYE.TAF TVEGYQS SGS ANVMTNQLFI GN 

Clo ste MGKMYE . VAL TVEGYQSSGY ANVYKNEIRI GANPTPAPSQ SP* 

Clo ace LGKMHE. TAF NIEGYQSSGK ADVNSMSINI GK 

Asp awa . NSDFNYQVM AVEAWSGAGS ASVTISS 

Bnc sub = Bacillus sub ti lis (AC. PI 8429). Bac cir ■ Bacillus circulans (AC. P09850), Sir liv 
= Strcpiomvccs fividans (AC. P26220). Sir ec3 » Stnspttunyecs sp. EC3 (AC. S47512). Hum ins « Humicofa 
insalcns (AC. S43919). Tri vir ■ Triclunhrma viridc (AC. A44594). Tri rcc » Trichoderma reesti (AC. 
P362I7). Sch com «= Sduzophyllum commune (AC. P3580V). Coc car - Cochliobolus carbonum (AC. 
Q065G2). Bac pum « Bacillus pnmi/us (AC. P0061M). Clo Sic 88 Clostridium stcrcorarum (AC. JQ1935), Clo 
acc - Clostridium acetobutylicum (AC. P 1 7 1 37). Asp awn e Aspergillus awamori (AC. P33557). 
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